Search results for "Simple random sample"

showing 10 items of 10 documents

Optimal selection of individuals for repeated covariate measurements in follow-up studies

2016

Repeated covariate measurements bring important information on the time-varying risk factors in long epidemiological follow-up studies. However, due to budget limitations, it may be possible to carry out the repeated measurements only for a subset of the cohort. We study cost-efficient alternatives for the simple random sampling in the selection of the individuals to be remeasured. The proposed selection criteria are based on forms of the D-optimality. The selection methods are compared with the simulation studies and illustrated with the data from the East–West study carried out in Finland from 1959 to 1999. The results indicate that cost savings can be achieved if the selection is focuse…

AdultStatistics and ProbabilityTime Factorsdata collectionEpidemiologyComputer sciencemissing covariate data01 natural sciences010104 statistics & probability03 medical and health sciences0302 clinical medicineHealth Information ManagementRisk FactorsStatisticsCovariateEconometricsHumans030212 general & internal medicineoptimal design0101 mathematicsrepeated measurementsFinlandSelection (genetic algorithm)Event (probability theory)ta112Data collectionPatient SelectionFollow up studiesta3142follow-up studyMiddle AgedSimple random sampleCardiovascular DiseasesResearch DesignCohortseurantatutkimusSelection methodFollow-Up StudiesStatistical Methods in Medical Research
researchProduct

A Procedure for Selecting Representative Subsamples of a Population from a Simple Random Sample

2015

This paper proposes a procedure for selecting large subsamples drawn from a large simple random sample that are more representative of the population under study. By means of the so-called constant of proportionality, the procedure seeks to maximize the size of the subsample taken from a stratified random sample with proportional allocation, restricting it to a p-value high enough to achieve a good fit using Pearson’s chi-square goodness of fit test. The user has the freedom to choose between a larger subsample with poorer adjustment or a smaller subsample with a better fit. We use the Continuous Sample of Working Lives (CSWL), a set of micro data taken from Spanish Social Security records,…

Engineeringeducation.field_of_studybusiness.industryPopulationSample (statistics)Simple random sampleRepresentativeness heuristicStratified samplingGoodness of fitStatisticsEconometricsChi-square testp-valuebusinesseducationSSRN Electronic Journal
researchProduct

Los emprendedores surgidos de las empresas multinacionales de inversión extranjera directa: un estudio exploratorio en Costa Rica

2014

ResumenEl presente trabajo busca evaluar la creación de empresas por parte de exempleados de empresas multinacionales de inversión extranjera directa. En concreto, se busca dimensionar el fenómeno, caracterizarlo, así como valorar el desempeño de las empresas creadas. El estudio se hizo mediante un muestreo aleatorio simple con margen de error del 7% y nivel de confianza del 95%, sobre una base de datos de 11.120 exempleados de empresas multinacionales en Costa Rica (n=175). Además se utilizó un grupo control ad hoc. Los resultados muestran cómo son estos emprendedores, el proceso creador experimentado, las características y el desempeño de las nuevas empresas.AbstractThe aim of this invest…

Facultad de Ciencias Administrativas y EconómicasEconomics and EconometricsInvestimento estrangeiro directoEmpresas multinacionaisEstudios GerencialesL26Strategy and ManagementMargin of errorForeign direct investmentlcsh:BusinessManagement of Technology and InnovationCriação de empresasProducción intelectual registrada - Universidad IcesiMultinational corporationsBusiness and International ManagementCreación de empresasMarketingWelfare economicsEntrepreneurshipSimple random sampleEmprendedoresEconomyInversión extranjera directaBusinessF23lcsh:HF5001-6182Foreign direct investmentFinanceEmpresas multinacionalesInversiones extranjeras directasEstudios Gerenciales
researchProduct

Improving the Representativeness of a Simple Random Sample: An Optimization Model and Its Application to the Continuous Sample of Working Lives

2020

This paper proposes an optimization model for selecting a larger subsample that improves the representativeness of a simple random sample previously obtained from a population larger than the population of interest. The problem formulation involves convex mixed-integer nonlinear programming (convex MINLP) and is, therefore, NP-hard. However, the solution is found by maximizing the size of the subsample taken from a stratified random sample with proportional allocation and restricting it to a p-value large enough to achieve a good fit to the population of interest using Pearson&rsquo

General MathematicsPopulation0211 other engineering and technologiessubsamplingSample (statistics)02 engineering and technologyRepresentativeness heuristic:CIENCIAS ECONÓMICAS [UNESCO]Nonlinear programming0502 economics and businessStatisticsComputer Science (miscellaneous)Chi-square testchi-square testp-value050207 economicseducationEngineering (miscellaneous)Mathematicseducation.field_of_study021103 operations researchlcsh:Mathematics05 social sciencesUNESCO::CIENCIAS ECONÓMICASp-valueSimple random samplelcsh:QA1-939Stratified samplingOptimización matemáticacontinuous sample of working livesEconomía públicaoptimizationMathematics
researchProduct

Dynamic Phase Diagram of the REM

2019

International audience; By studying the two-time overlap correlation function, we give a comprehensive analysis of the phase diagram of the Random Hopping Dynamics of the Random Energy Model (REM) on time-scales that are exponential in the volume. These results are derived from the convergence properties of the clock process associated to the dynamics and fine properties of the simple random walk in the $n$-dimensional discrete cube.

Physicsrandom environmentsspin glassesRandom energy model010102 general mathematicsagingrandom dynamicsSimple random sample01 natural sciencesLévy processclock processExponential function[MATH.MATH-PR]Mathematics [math]/Probability [math.PR]010104 statistics & probabilityCorrelation functionLévy processesConvergence (routing)Statistical physics0101 mathematicsCube[MATH]Mathematics [math]Phase diagram
researchProduct

Selection of Large Sub-Samples from the Continuous Sample of Working Lives Representative of the Benefits Provided by the Spanish Public Pension Syst…

2016

The Continuous Sample of Working Lives (CSWL) is a set of anonymized microdata with information about individuals taken from Spanish Social Security records. It provides very valuable information, which is used in many studies on labor economics and in the analysis of the Spanish public pension system. This article presents two major contributions: The first is an analysis of how representative CSWL is of the population of pensioners for the period 2005-2013. It is concluded that the CSWL does not follow the same distribution as the population with respect to some types of benefits, and that this happens in most waves. One of the reasons is that it is obtained by simple random sampling, so …

Social securityeducation.field_of_studyPensionComputer sciencePopulationSampling designEconometricsMicrodata (statistics)educationSimple random sampleRepresentativeness heuristicStratified samplingSSRN Electronic Journal
researchProduct

Horvitz-Thompson estimators for functional data: asymptotic confidence bands and optimal allocation for stratified sampling

2009

When dealing with very large datasets of functional data, survey sampling approaches are useful in order to obtain estimators of simple functional quantities, without being obliged to store all the data. We propose here a Horvitz--Thompson estimator of the mean trajectory. In the context of a superpopulation framework, we prove under mild regularity conditions that we obtain uniformly consistent estimators of the mean function and of its variance function. With additional assumptions on the sampling design we state a functional Central Limit Theorem and deduce asymptotic confidence bands. Stratified sampling is studied in detail, and we also obtain a functional version of the usual optimal …

Statistics and ProbabilityFOS: Computer and information sciencesApplied MathematicsGeneral MathematicsEstimatorSurvey samplingSimple random sampleAgricultural and Biological Sciences (miscellaneous)Statistics - ApplicationsStratified samplingMethodology (stat.ME)Sampling designStatisticsCluster samplingApplications (stat.AP)Statistics Probability and UncertaintyGeneral Agricultural and Biological SciencesBootstrapping (statistics)Statistics - MethodologyMathematicsVariance function
researchProduct

A Bayesian comparison of cluster, strata, and random samples

1999

When sampling from finite populations, simple random sampling (SRS) is rarely used in practice, due to either high cost or information to be gained from more efficient designs. Bayesian hierarchical models are a natural framework to model the non-randomness in the sample. This paper concentrates on the effects that the design has on inference about characteristics of the finite population, and makes a critical comparison among some common designs.

Statistics and Probabilityeducation.field_of_studyApplied MathematicsBayesian probabilityPopulationSampling (statistics)Sample (statistics)Simple random sampleStratified samplingsymbols.namesakeStatisticssymbolsCluster samplingStatistics Probability and UncertaintyeducationMathematicsGibbs samplingJournal of Statistical Planning and Inference
researchProduct

Using Complex Surveys to Estimate theL1-Median of a Functional Variable: Application to Electricity Load Curves

2012

Mean proles are widely used as indicators of the electricity consumption habits of customers. Currently, Electricit e De France (EDF), estimates class load proles by using point-wise mean function. Unfortunately, it is well known that the mean is highly sensitive to the presence of outliers, such as one or more consumers with unusually high-levels of consumption. In this paper, we propose an alternative to the mean prole: the L1-median prole which is more robust. When dealing with large datasets of functional data (load curves for example), survey sampling approaches are useful for estimating the median prole and avoid storing all of the data. We propose here estimators of the median trajec…

Statistics and Probabilityeducation.field_of_studyComputer sciencePopulationEstimatorSurvey samplingSampling (statistics)Simple random sampleStratified samplingHorvitz–Thompson estimatorOutlierStatisticsStatistics Probability and UncertaintyeducationInternational Statistical Review
researchProduct

The Tax Justice Network-Africa v Cabinet Secretary for National Treasury & 2 Others: A Big Win for Tax Justice Activism?

2019

This paper develops an optimization model for selecting a large subsample that improves the representativeness of a simple random sample previously obtained from a population larger than the population of interest. The problem formulation involves convex mixed-integer nonlinear programming (convex MINLP) and is therefore NP-hard. However, the solution is found by maximizing the “constant of proportionality” – in other words, maximizing the size of the subsample taken from a stratified random sample with proportional allocation – and restricting it to a p-value high enough to achieve a good fit to the population of interest using Pearson’s chi-square goodness-of-fit test. The beauty of the m…

education.field_of_studyPopulationStatisticsChi-square testSample (statistics)p-valueeducationSimple random sampleRepresentativeness heuristicStratified samplingMathematicsNonlinear programmingSSRN Electronic Journal
researchProduct